Limiting behavior of the distance of a random walk 
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Abstract 

This investigation is motivated by a result we proved recently for the random trans- 

Pm ■ position random walk: the distance from the starting point of the walk has a phase 

transition from a linear regime to a sublinear regime at time n/2. Here, we study three 

T^ ' new examples. It is trivial that the distance for random walk on the hypercube is smooth 

C^ ' and is given by one simple formula. In the case of random adjacent transpositions, we 

find that there is no phase transition even though the distance has different scalings 

in three different regimes. In the case of a random 3-regular graph, there is a phase 

transition from linear growth to a constant equal to the diameter of the graph, at time 

>: Slogan. 
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1 Introduction 

Let X" be the continuous time random transposition random walk on n markers. This 
means that at rate 1, we change the current permutation by performing a transposition 
of two randomly chosen elements. Let Z?" be the distance of X" from its starting point, 
i.e., the minimal number of transpositions necessary to change X" into Xq. The main 
result of Berestycki and Durrett |3] is that D^ has a phase transition at time n/2 as 
n — > oo. Writing — >p for convergence in probability. 

Theorem 0. Let t > 0. As n ^ oo n~^D'^^ ^p f{t) where f(t) is defined by: 

f(t) = l' for t< 1/2 

^^ V-ET=ih'-^i^t^~'')'<t fort>l/2 

Having seen this result, it is natural to ask in what situations is the asymptotic 
behavior of the distance from the starting point non smooth. We begin with a trivial 
example. 

1.1 Random walk on the hypercube 

Let X" be the random walk on the hypercube {0, 1}" that jumps at rate 1, and when 
it jumps the value of one randomly chosen coordinate is changed. By considering a 
version of the chain that jumps at rate 2, and when it jumps the new coordinate takes 
on a value chosen at random from {0, 1} it is easy to see that, when n = 1, 

Po{Xl = l) = {l-e-'')/2 

Let Df be the distance from X" to Xq, i.e., the number of coordinates that disagree. 
Since the coordinates in the continuous time change are independent it follows easily 
from this that 

Theorem 1. As n ^ oo, n^^D'^i — > (1 — e^^*)/2 in probability. 

1.2 Random adjacent transpositions 

Let X^ be the continuous time random adjacent transposition random walk on n mark- 
ers. Here we are thinking of Xt{j) as the location of particle j, but the dynamics are 
easier to formulate in terms of Yt{i) = X^ (i) which is the particle at location i. At 
rate 1, we change the permutation by picking l<i<n — lat random and exchanging 
the values of 5^"(i) and l^"(i + 1). Without loss of generality we can suppose Xq is 
the identity permutation /. The distance from a permutation a to I, i.e., the mini- 
mum number of adjacent transpositions needed to build a, is given by the following 
convenient formula 

dadj(o-) = hiv(o-) := #{1 < i < i < n : a{i) > a{j)} (1) 



Inv(cr) is called the number of inversions of a. If we view the set of permutations 5„ of 
{1, . . . , n} as a graph where there is an edge between a and a' if and only if a' can be 
obtained from a by performing an adjacent transposition (in the sense defined above), 
or vice- versa, then Xt has the law of simple random walk on this graph and dadj(^t) is 
the length induced by the graph distance of the shortest path between the current state 
of the walk, Xt, and its starting point, the identity. 

Erikkson et al. JJ and later Eriksen JOl considered the problem of evaluating the 
distance for the discrete time chain XJ^. Relying heavily on formula (^ they were able 
to carry some explicit combinatorial analysis, to obtain various exact formulae for this 
expected distance, such as this one: 






n' 

r=0 



^^>'a + 4*(^ 



(2) 



where C^ are the Catalan numbers and dr is a less famous non-negative integer sequence, 
which they define explicitly. 

While this formula is exact, it is far from obvious how to extract useful asymptotics 
from it. We will take a probabilistic approach based on the formula 

Dt = dadj(^r) = 22 ^{xr{i)>x^{j)} 

i<j 

If 1 < i < n is fixed, the trajectory X^{i) of the i*^ particle is a continuous time simple 
random walk on {1, . . . ,n} starting at i with jumps at rate 2/(n — 1) and reflecting 
boundaries at 1 and n that cause the particle to stay put with probability 1/2. 

Two such trajectories, say those of particles i and j with i < j, move by the nearest 
neighbor stirring process on {1, . . . ,n} (which for indistinguishable particles produces 
the simple exclusion process). When the particles are not adjacent, they perform in- 
dependent simple random walks. When they are adjacent, the only things that can 
happen are an exchange of the two particles, or one of them moves away from the other. 
As the reader can probably guess, and Durrett and Neuhauser [H] have proved on Z, 
when n is large the random walks behave as if they are independent. 

At small times, the behavior of the distance is messy but smooth. For an integer 
X > 0, let T^ denote the hitting time of the level x by a rate 4 random walk on Z 
starting at 0. Let {Y{t),t > 0} and {Y'{t),t > 0} be moved by random stirring on Z, 
with 1^(0) = and ^'(0) = 1 and let p{u) be the probability that at time u the two 
particles are exchanged, i.e., p{u) = P[Y{u) > Y'{u)], and note that this is the same as 
requiring the particles to have been exchanged an odd number of times. For all i > 0, 
let 

fit) := Y^ / P[T^ E ds]p{t - s) (3) 

~, Jo 
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Theorem 2. Let t > 0. Then n D^^ — >p f{t) as n ^ oo where f is the function defined 
by 1^. f{t) is infinitely differentiahle, and moreover it has the asymptotic behavior 

where Bt is a standard Brownian motion. 
To check the constant recall that 

P { max ^4^ = 2P{B^ > x) 

\0<s<l J 

SO integrating gives 
1 
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-E max Bis] = / PiBi > x)dx = EBJ = 2EBJ 
2 Vo<s<i I Jq 

2 f°° ^2 , [2 
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The next result looks at the distance of the random walk at times of order n'^, i.e., 



when each particle has moved of order n times, and hence has a significant probability of 
hitting a boundary. Let pt{u, v) denotes the transition function of B, a one-dimensional 
Brownian motion run at speed 2 reflecting at and 1. 

Theorem 3. Let t > 0. 

\Kh^p [ du [ dv [ pt{u,x)dx I ptiv,y)dy = P[Bi{t) > B2{t)] 
n Jo Ju Jo Jo 

where Bi and B2 are independent copies of B started uniformly onO < Bi{0) < i?2(0) < 
1 evolving independently. 

In between the two extremes we have a simple behavior. 

Theorem 4. Let s = s{n) with s — > 00 and sjv? -^ 0. Then 



ns P ,, 

n\/s V vr 



Recently, Angel et al. ^ have also used the simple exclusion process to analyze a process 
on the Cayley graph of the symmetric group generated by adjacent transpositions, but 
this time in the context of sorting networks. 



1.3 Random walk on a random 3-regular graph 

A 3-regular graph is a graph where ah vertices have degree equal to 3. To construct a 
random 3-regular graph we suppose n is even, and use the approach of Bollobas and de 
la Vega IS] (see also Bollobas jH|). Expand each vertex i into 3 "mini- vertices" 3i, 3i + l 
and 3i + 2, and consider a random matching a{j) of the 3n mini- vertices. A random 
3-regular graph Gn is then obtained by collapsing back the n groups of 3 mini- vertices 
into n vertices while keeping the edges from the random matching. We may end up with 
self-loops or multiple edges, but with a probability that is positive asymptotically, we 
do not, so the reader who wants a neat graph can condition on the absence of self- loops 
and multi-edges. 

Departing from our choices in the previous example, we consider the discrete time 
random walk X^, k >0, that jumps from j to [o"(3j + i)/3] where i is chosen at random 
from {0,1,2}. (We have used this definition since it works if there are self-loops or 
multiple edges.) Let D^ be the distance from the starting point at time t. 



' mm - , 1 



Theorem 5. For fixed t > 



logsn V3' 

An intuitive description of a random 3-regular graph, as seen from vertex 1, can be 
given as follows. Grow the graph by successively adding vertices adjacent to the current 
set. Branching process estimates will show that as long as the number of vertices 
investigated is 0(n^~'^), this portion of the graph looks very much like a regular tree 
in which each vertex has 2 edges going away from the root and 1 leading back towards 
the root. Thus, until the distance of Xn from Xq is > (1 — e) log2 n, D^ evolves like a 
(2/3,1/3) biased random walk on the nonnegative integers, with transition probabilities 
p{x, X + 1) = 2/3 and p{x, x — 1) = 1/3, and reflection at 0. After k moves we expect 
this walk to be at distance k/3. On the other hand, once the walk reaches a distance 
corresponding to the diameter of the graph, log2 n by Bollobas and de la Vega [H] , or 
Theorem 2.13 in Worwald J3]i it should remain at this level. Indeed, it cannot go any 
further, since this is the diameter. On the other hand the tree structure below makes 
it hard for it to come down back toward the root. 

Open Problem. The techniques developed for the random walk on a 3-regular graph 
should be useful when dealing with random walk on the giant cluster of a Erdos-Renyi 
random graph with p = c/n and c > 1, which locally has the geometry of a "Poisson 
mean c Galton- Watson tree". We conjecture that the random walk exhibits a phase 
transition like the one in Theorem but with a different constants in place of 3 and 
1 on the right-hand side. One technical problem is that the diameter is strictly larger 
than the average distance between points log n/ (log c), see Chung and Lu |Ij, so we 
don't have the easy upper bound. 



1.4 Other random walks 

We view the analysis of the examples above as the start of a more complete investiga- 
tion of what are the possible behaviors for the distance of a random walk. There are 
many other interesting examples to consider. For instance, Fulman ^2] has studied the 
evolution of the distance for the Gilbert-Shannon- Reeds riffle shuffle. For this shuffling 
method (which, we emphasize, is nonreversible), Bayer and Diaconis |2] proved an ex- 
plicit formula for the distribution of a after r shuffles. In particular from this formula 
it follows that the riffle-shuffle distance of a permutation is given by 

d{a) = [log2(Des(cr) + 1)] where Des(cr) = #{1 < i < n - 1 : a{i) > a{i + 1)} 

Des(<T) is called the number of descents of a. The main result of Fulman ^^ is 

Theorem 6. After r = log2(an) shuffles 

-E{Des{ar)) -^ a ^ 



n ' ' '' eV" -1 

at least if a > 1/{2tt). 

This says that for this range of r the walk is already in a sublinear regime. In 
particular, as q — > oo we get that d{ar) ~ log2 n since the expression 

1 1 

eV« - 1 1 + (l/a)/2 + • • • ' 

This is not surprising since this is the diameter of the graph. It is not clear at this point 
whether this formula also holds for smaller values of a, although it is tempting to let 
a — > and get that for small values of a the walk is "almost" linear (the fraction term 
with the exponential is much smaller than the other term). 

2 Random adjacent transpositions 

Let Xt, which we write for now on without the superscript n, to be the continuous time 
walk on permutations of {1, 2, ... n} in which at rate 1 we pick a random 1 < i < n — 1 
and exchange the values of Xt{i) and Xt{i + 1). As indicated in (^ the distance from 
a permutation a to the identity is dadj(o") = #{* < J '■ <7(i) > cr{j)}, the number of 
inversions of a. 

2.1 SmaU times 

The reflecting boundaries at 1 and n are annoying complications, so the first thing we 
will do is get rid of them. To do this and to prepare for the variance estimate we will 
show that if i < j are far apart then the probability Xt{i) > Xt{j) is small enough to 
be ignored. Let P^"-'"' be the probabilities for the stirring process with reflection at a 
and b, with no superscript meaning no reflection. 



Lemma 1. p[i'-l(X„i(i) > Xnt{j)) < 8P(X„i(0) > {j - i)/2) 
Proof. A simple coupling shows 

p[i'"](X„i(i) > Xntij)) < P[°'^-*](X,(0) > Xsij - i) for some s < nt) 

< 2p[0'-)(max X,(0)>(j-i)/2) 

\\)<s<nt J 

Using symmetry and then the reflection principle, the last quantity is 

<AP( max X,(0) > {j - i)/2\ < 8P{Xnt{0) > (j - i)/2) 

\0<s<nt I 

which completes the proof. D 

Since the random walk on time scale nt moves at rate 2, 

Eexp(eX„,(0)) = f;e-2tM: (^^±^) = exp(-2t + t{e' + e"^)) 
fc=o ■ ^ ' 

Using Chebyshev's inequality, if > 

P(X„i(0) > x) < exp(-ex + t[e^ + e"^ - 2]) (4) 

Taking = \ 

PiXntiO) > x)< Cte-'' where Ct = exp((e + e"'^ - 2)t) 

When X = 31ogn the right-hand side is Ctn~^, so using Lemma ^ for fixed t it 
suffices to consider "close pairs" with < j — z < 61ogn. The number of close pairs 
with z < 3 log n or j > n — 3 log n is < 36 log n, so we can ignore these as well, and the 
large deviations result implies that it is enough to consider random stirring on Z. 

We are now ready to prove the first conclusion in Theorem[2| if t > then as n ^ oo 



1 ^ ft 

-Dl, ^ f{t) = J2 / PIT"" G ds]pit - s) 



(5) 



Proof of ^. It is clear from the Markov property that if Xt{i) and Xt{j) are moved by 
stirring on Z then 



P{Xt{i) > Xtij)) = [ P[T^-' G ds]p{t - s 
Jo 





With the large deviations bound in (^J giving us domination we can pass to the limit 
to conclude 

1 °° /■* 



To prove convergence in probability let 

6j = hx,(i)>x,m - PiXtii) > Xtij)) 

By remarks above it suffices to consider the sum over 1 < i < j < n with < j — z < 
61ogn, i > 31ogn and j < n — 31ogn which we denote by T,* , and if i' > j + 61ogn 
then 

since the random variables have |^| < 1 and will be independent unless some random 
walk moves by more than 3 log n in the wrong direction. From this it follows that 

E {^*Ci,jf < n • (61ogn)^ + 4Ctn''\n • 61ogn)^ 

and the result follows from Chebyshev's inequality. D 

The remaining detail is to show that / is smooth and that as t ^> oo, 



hm f{t)/Vi = \e (max ^4.) 

t-^00 Z \0<s<l / 



(6) 



where B. is a standard Brownian Motion. The fact that / is infinitely differentiable 
follows easily from repeated use of Lebesgue's theorem and the fact that both p{u) 
and dP(T^ G du)/du are infinitely differentiable smooth functions. This is itself easily 
checked: for instance, if qj is the probability that a simple random walk in discrete time 
started at hits x in j steps, then T^ = ^°^^ gjGamma(j, 4), so T^ has a smooth 
density. A similar argument also applies for the function p{u). 

Proof of ©. The result follows easily from two simple lemmas. 
Lemma 2. p{t) -^ 1/2 as t ^ 00. 

Proof. Each time there is jump when the particles Y and Y' are adjacent, they have a 
probability 1/3 of being exchanged the next step. So, conditionally on the number of 
such jumps N , the number of actual swaps between Y and Y' is Binomial(A^, 1/3). Now, 
y > y' if and only if the number of times they are swapped is odd. Hence the lemma 
follows from the two observations : (i) As t ^ 00, the number of jumps while they 
are adjacent to each other — > 00, and (ii) as A^ ^ 00, P[Binomial(A^,p) is odd] — > 1/2 
for any given < p < 1. For (i), observe that the discrete-time chain derived from 
{|1^ — Yj-'l — 1, t > 0} is a reflecting random walk on {0, 1, . . .}, and therefore visits 
infinitely many times, (ii) is an easy fact for Bernoulli random variables. D 

Lemma 3. 

^ 00 

—J2p[T-^{t-\ogt,t)]^G 



Proof. The random walk can only hit a new point when it jumps so 



t E [ y^ l^T'^^(^t-logt,t)} ] ^ t -^'(T^jumps of the random walk in(i — logt, t)) 

< t^^/2-(41ogt) ^0 
since jumps occur at rate 4. D 

It is now straightforward to complete the proof. Let e > 0. Fix T large enough so 
that \p{t) — 1/2| < e as soon as t > T. Then by Lemma|31 for t>T':= e^ , letting Wt 
be a simple random walk on Z in continuous time jumping at rate 1, 



^^ rt—iogt 

t-'/^f{t) = t-^'^Y. P[T^&ds]p{t-s)+o{l) 

- {\ + ^)i^"^flP[T'' <t-\ogt\+o{l) 

^ ^ x=l 

< (- + e]t~^^^y"p( max W^s > x] 
\2 J ^ \s<t^\ost J 

- + e] E maxB^s- 
2 J s<i 

by Donsker's theorem. The other direction liminft_>oo i~^/(0 ^ (1/2— e)-EmaXs<i i?4s 
can be proved in the same way. D 

2.2 Large times 

Our next goal is to prove that if t > then 

^L»;j3f -^ [ du [ dv I ptiu,x)dx [ ptiv,y)dy = P[Bi{t) > B2{t)] (7) 

n Jq Ju Jo Jo 

in probability and where Bi and B2 are two reflecting Brownian motions run at speed 
2 started uniformly on < ^i(O) < ^2(0) < 1 and evolving independently. 

Proof. We first show that the expected value converges. The first step is to observe that 
the rescaled random walks X^-n{i)/n, t >0 converge to reflecting Brownian Motion on 
[0, 1]. Indeed, Durrett and Neuhauser (2.8)] showed that for fixed i < j, the rescaled 
pair of random walks converge to two independent Brownian Motions. They did this 
on Z but the proof extends in a straightforward way to the current setting. Their proof 
shows that if i/n -^ x and j/n — > y we have 

P[XMt) > X„3,(j)] - P.,y[Blit) > Mt)] 



This implies that the convergence occurs uniformly on the compact set so 



1 



n 



^EKh 



t<j 



stii) > XMJ)] ^ P[Biit) > B2it)] 



To get the convergence in probability, we use second moment estimates. Let Aij 
{X„3i(i) > X^BtiJ)}. 



«!>:.. 



;^EE^[-^« 



n^ 



kM 



i<j k<l 



The first step is to observe that there are only 0{'n?) terms in which two of the indices are 
equal so these can be ignored. When the four indices are distinct we can again apply Dur- 
rett and Neuhauser's [HI results to the 4-tuple of random walks {X{i),X{j),X{k), X{1)), 
to conclude that if i/n -^ x, j/n -^ y, k/n — > x' and l/n — > y' 

P[A,^,nAk,i] ^ P,,y[Bi{t) > B2{t)]P,,' ^y'[Bi{t) > B2it)] 

From this it follows that 



E 



1 



DO:, 



n 



2 n'^t 










In other words, the variance of n D^^^ is asymptotically 0, and applying Chebyshev's 
inequality, we get the convergence in probability to the limit of the means. D 

2.3 Intermediate regime 

The proof of Theorem 0] is a hybrid of the two previous proofs. We first truncate to 
show that it suffices to consider i < j close together and far from the ends, then we 
compute second moments. We begin with a large deviations result: 

Lemma 4. For all x > and t > then 

P{Xnt{0) > x)< exp(-xV8ei) + exp(-xln(2) - 2t) 



Proof. First assume x < 4et. From ^ we have P{Xnt{0) > x) < ex.p{—0x + t[e^ + 
e-^ - 2]). When < 6* < 1 



e" + e- 



< 



2"^ 4[^ 6[^ 
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and by continuity this is valid also when = 1. Taking 9 = x/Aet which is < 1 by 
assumption 



P(X.,(0)>.)<exp(-^ + t.l(^)^)<exp 



(-xVSet) (8) 

When X > 4ei, remark that P{Xnt{0) > x) is smaller than the probability that a Poisson 
random variable with mean 2i is greater than x. Thus for any ^ > this is by Markov's 
inequality smaller than exp(— 0x + 2t(e^ — 1)). This is optimal when e^ = x/2t, in which 
case we find that 

P{Xnt{0) >x) < exp(-x ln(x/2t) + x - 2t) < exp{-x ln(2) - 2t) (9) 

since x > 4et. Equations ^ and © give us two bounds valid in different regions, 
so by summing them we get a bound that is everywhere valid, and this concludes the 
proof. n 

Proof of Theorem^ By assumption we can pick Kn — > oo so that K^^/s/n — > 0. By 
Lemma ^ 



1 1 °° 

— V p[^'"l(^n.(i)>X„,(i)) < 8^ V P{Xns{0)>x/2) 

P(X„,(0)> \x^sl2\)dx 



i,j>i+Kn^/s X=Kn^ 



K„ 



Applying Lemma |1] it follows that 



Uy/S ^-^ 

i,j>i+Kn\/s 

Letting /jj be the indicator of {Xns{i) > Xns{j)} it follows that 

— — \ /j ,• ^ in probability 

riy/s ^-^ 

i.e., we can restrict our attention to close pairs. Once we do this, we can eliminate ones 
near the ends since 



riyjs ^-^ n\/s 



by assumption. In a similar way we can eliminate j > n — Kn\fs. 

It follows that it is enough to consider random stirring on Z. The result of Durrett 
and Neuhauser |H| implies that if s — > oo, i > Kn\fs., j < n — Kn\fs and (j — i)/ y/s — > x 
then 



EL i -^ -P I max i?4i > x ] 
''^ 2 \o<t<i J 
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where the right-hand side is if x = oo. Writing S* again for the i,j with i > Kn^/s, 

j < n — Kn\/s and < j — i < Kn\fs, and using the domination that comes from 

Lemma lU it fohows that 

11 

zTi*EIij — > —E max. B^t 



ny/s '■' 2 o<t<i 

The next step is to compute the second moment. The number of terms with one 
index \n i < j equal to one oi k < I with both pairs close is < n{Kny/s)'^ , which when 
divided by (ny^)^ tends to 0. The result of Durrett and Neuhauser jH] implies that 
terms in which all four indices are different are asymptotically uncorrelated. We remark 
that in Lemmal^we can also get an upper-bound on P{Xnt{0) > x)^'^ by summing the 
square-roots of the two terms in Q and @ since only one of them applies in a given 
region. This and Cauchy-Schwartz's inequality provide the justification for the passage 
to the limit: 



:^i<jK<eE{Ii,jIk,e) -^ ( -^ max fi4ij 



and the rest of the argument is the same as in Theorem |3J D 

3 Random walk on a random 3-regular graph 

Let Gn be a random 3-regular graph constructed as in the introduction and let Xk be 
the discrete time random walk on Gn, where for simplicity we drop both the superscript 
n and the hat to indicate discrete time. We assume that Xq = 1 and write D^ for the 
graph distance from X^ to Xq. Our goal is to prove Theorem 5, that is, for fixed t > 



log2n \3' 

3.1 Proof for the subcritical regime 

Let u be a vertex at distance I from the root. We say that v is a "good" vertex if it 
has two edges leading away from the root (at distance / -|- 1) and one leading back to 
distance l — l. Otherwise we say that f is a "bad" vertex. Let B{1) be the set of all bad 
vertices at distance /. 

Lemma 5. Let 2 < v < n be a vertex distinct from the root. Given that v is at distance 
I from the root, P{v G B{1)) < 2i/n where i = 2 . 

Proof. First consider the event that v has an edge leading to some other vertex at 
distance /. Since it is at distance /, it must have at least one edge leading backwards, 
so there are only two other edges left. In particular there are at most 2' = i vertices at 
distance /. In G„ those i vertices at distance I correspond to 2i unpaired mini- vertices, 
so the probability of a connection sideways to another vertex at distance i is smaller 
than 2i/3n. 



12 



When V has two edges leading forward, the probabihty that one of its children is 
connected to another vertex from level / is also smaller than 2i/3n since there are at most 
2i edges leading to level / + 1. Since v has at most 2 children, this gives a probability of 
at most 4i/3n. Combining this with the estimate above gives 2i/3n + 4i/3n = 2i/n. D 

A simple heuristic now allows us to understand that with high probability the ran- 
dom walk will not encounter any vertices as long as we are in the sub critical regime. 
Before we encounter a bad vertex, the distance is a (2/3,1/3) biased random walk and 
hence spends an average of 2 steps at any level. Hence, the expected number of bad 
vertices encountered until time distance (1 — e) log2 n is smaller than 

(l-e)log2n , 



E 2- = O(n-)^0 



n 
1=1 

To prove this rigorously, let A^ denote the event that by time k the random walk has 
never stepped on a bad vertex up to time k. 

Lemma 6. As n ^ oo, -P(^3(i-£)iog2n) ^ 1- 

On this event, for each 1 < j < 3(1 — e) log2 n, Xj has probability 2/3 to move away 
from the root and 1/3 to move back towards the root, and the first part of Theorem [HI 
follows easily. 

Proof. By Lemma IHl the probability that some vertex within distance L of 1 is bad is 

A ,2-2^ 2 22^ 

< > 2^ < ■ > 

^-^ n n I- 1/4 

£=1 ' 

ifL = (l/3)log2n. 

Since for each vertex there are at most two edges leading out and one leading back, 
the distance from the starting point is bounded above by a (2/3,1/3) biased random 
walk. Standard large deviations arguments imply that there are constants C and a 
depending on p so that 

P{d{Xk) > pk) < Ce-"'' (11) 

Summing from k = L to oo, we see that with high probability d{Ak) < pk for all k > L. 
When this good event occurs for A; > L, it follows from Lemma El that 

/ 2P'^\ -^ / 2P^\ 

p{Ak+i) > p{Ak) (i - 2— j > n ( 1 - 2— ) 

Taking the logarithm, we have for large n 

k 



\ogP{Ak+i) > El°g (1-277) 



L 
k 






> -4 > — > 



23 4 2^P 



n ~ \-2-P n 
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We want to take k = 3(1 — e) log2 n. By choosing p close enough to 1/3 so that 
3p(l — e) < 1, we have 2^P/n = n^" with a > which proves the desired result. D 

3.2 Proof for the supercritical regime 

Here we wish to prove that if A; = tlog2 n, with t > 3(1 — e), then d{Xk) ~ log2 n. As 
already noted, this is the diameter of G„ so all we have to prove is that once it reaches 
this distance it stays there. To do this we let 

L{a,b) := {2 < V < n : d{v) € [alog2n,blog2n]} 

and consider L{1 — £,1 — 5). 

Intuitively, this strip consists of about 7i^~^ trees, each with at most n^~ vertices. 
However, there are sideways connections between these trees so we have to be careful 
in making definitions. Let vi,...,v„i be the m vertices at level (1 — e)log2?^• For 
j = 1, . . . ,7n if V £ L(l — e,l — 5), we say that v S Tj if Vj is the closest vertex to v 
among vi,...,Vm- 

To estimate the number of sideways connections (i.e., edges between vertices v and 
v' in different Tj's), we use: 

Lemma 7. The number of subtrees that Tj is connected to is dominated by a branching 
process with offspring distribution Binomial{n'^~ ,n~ ). 

Proof. Each tree to which we connect requires a bad connection (i.e., one of the two 
possible errors in Lemma [3. Suppose we generate the connections sequentially. The 
upper bound in Lemma [3 holds regardless of what happened earlier in the process, 
so we get an upper-bound by declaring each vertex at level / bad independently with 
probability 2i/n with i = 2\ so this probability is at most n~ . Since there are at most 
n^~ vertices in a given subtree, the lemma follows immediately. D 

Lemma 8. If 6 > e/2 then there exists some K = K{£,6) > such that, 

P (there is a cluster of trees Tj with more than K bad vertices) — > 

Proof. The worst case occurs when each bad connection in a tree leads to a new one. 
Let 

X = B\n{n'~\n~^) 

be the offspring distribution of the branching process of the previous Lemma. In par- 
ticular 

E{X) = 0(n^-25) ^ 

Let c = n'^~ , and let N = n^~ be the total number of vertices in Tj, so X = 
Binomial (A^,c/A^). 
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Lemma |7| follows from a simple evaluation of the tail of the total progeny Z of a 
branching process with offspring distributed as X. To do this, we let 

fc=0 ^ ^ 

be the moment generating function of X — 1. Let Sk be a random walk that takes steps 
with this distribution and Sq = 1. Then r = inf{/c : S'^ = 0} has the same distribution 
as Z. Let Rk = exp{9Sk)/4>N{&)^- Rk is a nonnegative martingale. Stopping at time r 
we have e^ > E{(pN{Oy )■ If <pNi0) < 1 it follows that 

p{r > y)^N{ery < E[^N{ern < e' 

Using 0Af (0) < e^^ exp(c(e^ — 1)) now we have 

P{r >y)<e^ (e-^exp(c(e^ - 1)))^ 

To optimize the bound we want to minimize c(e — 1) — 6. Differentiating this means 
that we want ce^ — 1 = or = — log(c). Plugging this and recalling that r and Z have 
the same distribution we have 

P{Z> y) < -exp(-(c-l-lnc)y) 

Substituting c = n"" with a = 26 — e, we find that 

P{Z >y)< n"exp(y(l - alog(n))) 

Since there are m < n^~^ trees to start with, the probability that one of them has more 
than y trees in its cluster is smaller than 

n ^^n" exp(y(l — a log n)) 

so if 

a + l-e 

y> :=K{6,e) 

a 

then the probability than one cluster contains more than y trees tends to 0. This implies 

that with probability 1 asymptotically, no cluster of trees has more than K bad vertices, 

since the branching process upper-bound is obtained by counting every bad vertex as a 

sideways connection. D 

With Lemma El established the rest is routine. In each cluster of trees there is 
a stretch of vertices of length > alog2ra where a = (e — 5)/{K + 1) with no bad 
vertices. The probability of a downcrossing of such a strip by a (2/3,1/3) random walk 
is < (1/2)" ^°S2" = 7),-<J go the probability of one occurring in n"'''^ time steps tends to 0. 
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